Variance-Based Exploration for Learning Model Predictive Control

نویسندگان

چکیده

The combination of model predictive control (MPC) and learning methods has been gaining increasing attention as a tool to systems that may be difficult model. Using MPC function approximator in reinforcement (RL) is one approach reduce the reliance on accurate models. RL dependent exploration learn, currently, simple heuristics based random perturbations are most common. This paper considers variance-based geared towards using approximator. We propose use non-probabilistic measure uncertainty value value-based methods. Uncertainty measured by variance estimate inverse distance weighting (IDW). IDW framework computationally cheap evaluate therefore well-suited an online setting, already sampled state transitions rewards. gradient then used perturb policy parameters direction where increasing. proposed method verified two simulation examples, considering both linear nonlinear system dynamics, compared standard perturbations.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning-based Model Predictive Control for Safe Exploration and Reinforcement Learning

Learning-based methods have been successful in solving complex control tasks without significant prior knowledge about the system. However, these methods typically do not provide any safety guarantees, which prevents their use in safety-critical, real-world applications. In this paper, we present a learning-based model predictive control scheme that provides provable high-probability safety gua...

متن کامل

Linear model predictive safety certification for learning-based control

While it has been repeatedly shown that learningbased controllers can provide superior performance, they often lack of safety guarantees. This paper aims at addressing this problem by introducing a model predictive safety certification (MPSC) scheme for polytopic linear systems with additive disturbances. The scheme verifies safety of a proposed learningbased input and modifies it as little as ...

متن کامل

Learning-based model predictive control for Markov decision processes

We propose the use of Model Predictive Control (MPC) for controlling systems described by Markov decision processes. First, we consider a straightforward MPC algorithm for Markov decision processes. Then, we propose value functions, a means to deal with issues arising in conventional MPC, e.g., computational requirements and sub-optimality of actions. We use reinforcement learning to let an MPC...

متن کامل

Learning Model Predictive Control for Iterative Tasks

A Learning Model Predictive Controller (LMPC) for iterative tasks is presented. The controller is referencefree and is able to improve its performance by learning from previous iterations. A safe set and a terminal cost function are used in order to guarantee recursive feasibility and nonincreasing performance at each iteration. The paper presents the control design approach, and shows how to r...

متن کامل

Model-Based Predictive Control

MBPC is a feedback-control methodology suitable to enforce efficiently hard constraints on the variables of the controlled system. It is shown that the method hinges upon a constrained open-loop optimal control problem along with the adoption of the so-called receding-horizon control strategy. In the important case of time-invariant linear saturated ANCBI systems, MBPC algorithms can be devised...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2023

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2023.3282842